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Abstract 

CXCL1 2-CXCR4-CXCR7 signaling promotes tumor growth and metastasis in breast cancer. Alternative splicing of 
CXCL12 produces isoforms with distinct structural and biochemical properties, but little is known about isoform- 
specific differences in breast cancer subtypes and patient outcomes. We investigated global expression profiles of 
the six CXCL12 isoforms, CXCR4, and CXCR7 in The Cancer Genome Atlas breast cancer cohort using next- 
generation RNA sequencing in 948 breast cancer and benign samples and seven breast cancer cell lines. We 
compared expression levels with several clinical parameters, as well as metastasis, recurrence, and overall survival 
(OS). CXCL12-Q -/?, and -y are highly co-expressed, with low expression correlating with more aggressive 
subtypes, higher stage disease, and worse clinical outcomes. CXCL 12-5 did not correlate with other isoforms but 
was prognostic for OS and showed the same trend for metastasis and recurrence-free survival. Effects of CXCL12-5 
remained independently prognostic when taking into account expression of CXCL12, CXCR4, and CXCR7. These 
results were also reflected when comparing CXCL12-a, -/?, and -yln breast cancer cell lines. We summarized 
expression of all CXCL12 isoforms in an important chemokine signaling pathway in breast cancer in a large clinical 
cohort and common breast cancer cell lines, establishing differences among isoforms in multiple clinical, 
pathologic, and molecular subgroups. We identified for the first time the clinical importance of a previously 
unstudied isoform, CXCL12-5. 

Translational Oncology (2014) 7, 429-438 



Introduction 

Nearly all human genes undergo alternative splicing, substantially 
increasing diversity in protein structure and function [1], Genome- 
wide analyses of several different cancers demonstrate extensive 
perturbations in splicing during tumor initiation and metastasis [2,3]. 
Alternatively spliced proteins regulate fundamental processes in 
cancer, including apoptosis, metabolism, and metastasis, suggesting 
that dysregulated splicing is critical to malignancy [4-6]. As 
prominent examples of alternative splicing in cancer, a switch from 
pyruvate kinase M 1 to the M2 isoform drives anabolic metabolism in 
malignant cells, and a novel splice variant of the transmembrane 
protein CD44 promotes metastasis [5,7-9]. Isoforms of these and 
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other genes preferentially expressed in malignant versus normal tissues 
provide potential biomarkers for detection of cancer and may 
contribute to drug resistance of cancer cells. Identifying changes in 
protein isoform expression in cancer will improve understanding of 
key signaling pathways in tumorigenesis and point to novel 
therapeutic targets to improve cancer therapy [10,11]. 

Chemokine CXCL12 and its chemokine receptors CXCR4 and 
CXCR7 (recently renamed as ACKR3) comprise a signaling axis 
strongly linked to tumor growth and metastasis in breast cancer and 
more than 20 other malignancies [12,13]. CXCL1 2 binding to CXCR4 
activates pathways including phosphatidylinositol-3 kinase and mito- 
gen-activated protein kinases to promote growth, survival, and 
chemotaxis of breast cancer cells. High levels of CXCL12 are expressed 
in common sites of breast cancer metastasis such as lung, liver, bone, 
and brain [14]. CXCR4 commonly is upregulated on breast cancer cells, 
and numerous studies have demonstrated both gene and protein 
overexpression of CXCR4 on cancer cells in primary breast tumors [15- 
18]. The anatomic distribution of CXCL12 and studies in mouse 
models of cancer suggest that gradients of this chemokine drive local 
invasion and subsequent homing of CXCR4+ breast cancer cells to 
secondary sites [18,19]. CXCR7 also is expressed by breast cancer cells 
and stromal cells, such as endothelium on tumor vasculature, in primary 
breast cancers [20]. CXCR7 functions as a scavenger receptor for 
CXCL12, functioning in part to decrease amounts of this chemokine in 
the extracellular space and establish chemotactic gradients [21,22]. 
CXCR7 also promotes survival and invasion of malignant cells [23]. 

Although six different isoforms of human CXCL12 (a, [3, 7, 5, E, 
and (p) have been described, most studies of CXCL12 focus only on 
the ot isoform or do not distinguish among isoforms [24]. CXCL12 
may be secreted by malignant cells in primary breast cancers in 
addition to carcinoma-associated fibroblasts and/or mesenchymal 
stem cells in the tumor microenvironment [17,25,26]. Fibroblasts 
isolated from primary breast tumors secrete CXCL12 at higher levels 
than fibroblasts from normal mammary tissue despite no genetic 
mutations in stroma [27,28] . These findings suggest that cancer cells 
stimulate adjacent fibroblasts to produce higher levels of total 
CXCL12 in breast tumors than normal mammary tissue [28]. 
However, while these data demonstrate that carcinoma-associated 
fibroblasts are characterized by increased CXCL12, it remains unclear 
to what extent total CXCL12 in breast cancers differs from normal 
tissue and affects prognosis. On the basis of expression of CXCL12-a 
and -/? in two different breast cancer microarray data sets and 
immunohistochemistry (IHC) of primary breast tumors, Mirisola and 
colleagues reported that higher expression levels of CXCL12-a and -/? 
correlate with better disease-free survival [29]. However, a separate 
high throughput analysis of CXCL12 expression concluded that 
higher CXCL12 levels correlate with increased metastasis and local 
recurrence in breast cancer [17]. Determining effects of high versus 
low CXCL12 on prognosis and disease progression in breast cancer is 
essential to direct optimal use of therapeutic antibodies and other 
agents being developed for CXCL12-targeted cancer therapy [30]. 

Prior genetic analyses of mRNA for CXCL12 isoforms have used 
microarrays, which frequently lack probes to detect specific isoforms 
of these genes. However, next-generation sequencing overcomes this 
limitation. Using bioinformatics analysis of publicly available data sets 
from The Cancer Genome Atlas (TCGA), we investigated expression 
of CXCL12 isoforms, as well as CXCR4 and CXCR7 in breast cancer. 
We then correlated patterns of expression with important molecular 
phenotypes, clinical parameters, and outcomes in these patients. 



These analyses revealed distinct differences in expression for various 
isoforms of these genes. We show that low levels of expression of 
CXCL12 correlate with worse prognosis in breast cancer with 
isoform-specific differences among a, P, "y, and 8 isoforms. These 
data demonstrate the impact of CXCL12 isoforms in breast cancer 
and underscore the need to better understand functional differences 
among these molecules in disease progression and therapy. 

Methods 

Study Design and RNA Sequencing 

Publicly available RNA next-generation sequencing and clinical 
data (844 breast cancer and 104 benign breast samples) were retrieved 
from TCGA for breast cancer [31]. Additional clinical data such as 
PAM50 clustering and clinical follow-up for the TCGA were 
obtained from the UCSC Cancer Genomics Browser [32]. RNA 
sequencing data for seven breast cancer cell lines (two samples each) 
were obtained from the Illumina iDEA database (www.illumina. 
com). Three of these cell lines have been shown to have metastatic 
potential (BT20, MDA-MB-231, and MDA-MB-468), and four cell 
lines have been shown to have no metastatic potential (BT474, 
MCF7, T47D, and ZR-75-1) [33-35]. RNA sequencing reads were 
aligned to the genome with Tophat [36] using Genome Reference 
Consortium Human Build 37 (GRCh37 or hgl9) (www.ncbi.nlm. 
nih.gov) as the reference genome. 

Seven hundred eighty-five of the cancer samples had clinical data 
from TCGA, and 832 had data from UCSC Cancer Genome 
Browser. Her2 status was not included as a column, so we calculated 
it based on the IHC data column. For cases with equivocal IHC, we 
then used the in situ hybridization data column. 

Normalization was performed using Fragments per Kilobase per 
Million, and isoform expression values were generated using Cufflinks 
with Ensembl version 69 as the reference transcriptome [37]. 
Cufflinks calculates isoform expression levels using a statistical model 
in which the probability of observing a given fragment is a linear 
function of the transcript abundance. Gene level expression is the sum 
of transcript level expression, as each read is assigned to a single 
transcript. Tophat was chosen because it is the standard sequence 
aligner used by Cufflinks [38]. 

Statistical and Bioinformatics Analysis 

Correlation coefficients were generated using Spearman's correla- 
tion. Hierarchical clustering was performed on the covariance 
matrices to generate heat maps. Expression levels of the isoforms 
and at the gene level were compared across clinical and pathologic 
groups such as cancer versus normal, tumor stage, histology, 
hormone receptor status, and PAM50 cluster [39]. Means between 
groups were compared using analysis of variance. Expression was 
divided into high versus low expression using the median expression 
value. Kaplan-Meier curves were generated for the high and low 
expression groups and compared using the log-rank test for 
metastasis-free survival (MFS), recurrence-free survival (RFS), and 
overall survival (OS). Hazard ratios (HRs) were generated using 
univariate Cox regression. Multi-gene analysis was performed using 
Cox regression with expression of each gene/isoform as a covariate. 
Comparison of expression between metastatic versus non-metastatic 
cell lines was performed using Student's £-test. Statistics and plots 
were generated using the R statistical computing software and 
GraphPad Prism. 
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Results 

Differential Expression of CXCL12 Isoforms, CXCR4, and 
CXCR7 in Breast Cancer versus Normal Breast Tissue 

Studies of isoforms of CXCL12 in cancer and other diseases have 
been limited by the lack of isoform-specific probes on microarrays and 
antibodies for IHC. As a result, studies have focused predominantly 
on only the a and P isoforms of CXCL12. To overcome limitations of 
microarrays and antibodies, we investigated expression levels of all 
isoforms of CXCL12 and receptors CXCR4 and CXCR7 in breast 
cancer using the TCGA RNA sequencing data set. The clinical and 
pathologic characteristics of the tumor samples and patients in this 
data set are shown in Table 1 . 

The Cufflinks analysis program assigns each read to individual 
isoforms such that the sum of expression levels for a specific isoform is 
equal to the gene level of expression. On the basis of this analysis, we 
determined that the most common isoform of CXCL12 in breast 
cancer is a (65%), followed by B (27%) > y (5%) > 8 (2%). We 
detected only very low levels of expression for CXCL12-S (0.1%) 
and -<p (0.2%) and therefore refrained from statistical inference using 
these isoforms. 

To establish for the first time correlations among CXCL12 isoforms, 
CXCR4, and CXCR7, we examined the Spearman's covariance matrix 
with hierarchal clustering for breast cancer and normal tissues, 
respectively (Figure 1, A and B). In breast cancer, CXCL12-a and -B 
were highly correlated (correlation coefficient of 0.91), and these two 
isoforms also correlated highly with gene-level expression of CXCL12 
(correlation coefficients of 0.99 and 0.95 for CXCL12-a and -/?, 
respectively; Figure \A). By comparison, the y, e, and (p isoforms of 
CXCL12 correlated moderately with gene-level expression of a and B 
(correlation coefficients of 0.44 to 0.59). Interestingly, the 8 isoform, 
which is not well characterized in the literature, correlated very poorly 



Table 1. Cancer Patient Characteristics Compiled from TCGA. 



Mean Age (Years) 


58.0 ± 13.3 


Median Follow-Up (Months) 


23.7 


Variable (n) 




Variable (n) 




ER 




Node stage 




Positive 


579 


0 


361 


Negative 


170 


1 + 


409 


PR 




Death 




Positive 


505 


Positive 


104 


Negative 


241 


Negative 


6-5 


Her2 recepror 




Recurrence 




Positive 


141 


Positive 


53 


Negative 


508 


Negative 


315 


PAM50 status 




Metastasis 




Basal 


13S 


Positive 


48 


Her2 


66 


Negative 


324 


Luminal A 


414 


Gender 




Luminal B 


190 


Female 


776 


Normal 


24 


Male 


9 


Overall srage 




Menopause 




1 


127 


Pre-menopause 


181 


2 


127 


Post-menopause 


493 


3 


171 


Race 




4 


15 


White 


541 


Tumor srage 




Black or African American 


55 


1 


207 


Asian 


5 1 


2 


467 


American Indian or Alaskan Native 


1 


3 
4 


76 
32 







with the other CXCL12 isoforms (correlation coefficients of -0.11 
to 0.27) and in cancer samples, clustered with CXCR4 and CXCR7 
rather than with the other CXCL12 isoforms. CXCR4 and CXCR7 
displayed a weak positive correlation with gene-level expression of 
CXCL12 and its a and B isoforms but did not correlate with each other. 
These same general correlations were present in normal samples 
(Figure IB). However, in normal samples, CXCR7 tended to correlate 
inversely with CXCR4, and CXCR7 also exhibited modest to strong 
correlations with CXCL12-a and -/? and overall gene-level expression of 
this chemokine. 

We next investigated levels of expression for various chemokine and 
receptor isoforms in cancer and normal tissues. While previous 
publications report discordant results for CXCL12 in breast cancer 
versus normal breast, our analysis showed significant down-regulation of 
CXCL12-a, -B, and -y in cancer (Figure 1 Q. Expression of CXCL12-8 
also decreased in cancer as compared with normal, although differences 
were not significant. Similarly, CXCR7 was downregulated in cancer. 
CXCR4 demonstrated the opposite pattern with up-regulation in 
cancer, consistent with prior literature [15,17,18]. 

Variations among CXCL12 Isoforms, CXCR4, and CXCR7 
with Clinical and Molecular Staging Parameters 

Within cancer samples, CXCL12-a, -B, and - y varied significantly 
with tumor stage (Figure 2A). For these isoforms of CXCL12, lower 
stage tumors had higher levels of expression with the highest 
amounts of each isoform present in stage I primary breast tumors. 
We observed a similar trend for gene-level expression of CXCL12. 
We also compared differences in expression of various isoforms with 
histologic classifications of breast cancer. Invasive ductal and 
invasive lobular carcinomas comprise the majority of the TCGA 
data set, and most of the mixed histology samples contain features of 
both invasive ductal and lobular cancer. Gene-level expression of 
CXCL12, as well as a, B, and y isoforms, showed significant 
variations across different histologic groups (Figure 2B). Amounts of 
total CXCL12 and these three isoforms were highest in invasive 
lobular cancer with a rank order of invasive lobular > mixed > inva- 
sive ductal carcinoma. We note that lowest levels of expression for 
CXCL12 and the a, B, and y isoforms occurred in less common 
histologic types of breast cancer, medullary and mucinous. Other 
isoforms of CXCL12 did not vary significantly with tumor stage or 
histologic type, and we also did not identify significant correlations 
with gene-level CXCR4 or CXCR7. 

In clinical oncology, breast cancers are categorized on the basis of 
hormone receptors [estrogen (ER) and progesterone (PR)] and 
amplification of the oncogene Her2. These categories determine 
prognosis and treatment options [40]. We analyzed expression of 
CXCL12, CXCR4, and CXCR7 and individual isoforms in tumors 
positive for both ER and PR, Her2 only, and all three receptors (triple 
positive), as well as primary cancers lacking expression of these three 
receptors (triple negative). Gene-level expression of CXCL12 and the 
a and ft isoforms each varied significantly across these subtypes with 
highest amounts in ER/PR positive and triple positive cancers 
(Figure ?>A). By comparison, levels of overall CXCL12, CXCL12-a, 
and CXCL12-B decreased in triple negative cancer and to an even 
greater extent in Her2 positive tumors. Other isoforms of CXCL12 
did not vary significantly with receptor status. CXCR7 varied with 
receptor status in a pattern comparable to CXCL12 (Figure ?>A). 
Levels of CXCR7 were highest in ER/PR positive and triple positive 
tumors with lower expression in triple negative and Her2 positive 
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Cancer 



B 



Normal 




Figure 1. Spearman's covariance matrix with hierarchal clustering of CXCL12 isoforms, CXCR4, and CXCR7 for (A) breast cancer and 
(B) normal tissues. (C) Expression levels of CXCL1 2 isoforms, CXCR4, and CXCR7 in breast cancer and normal tissues. Expression levels 
are means ± SEM. *P < .05 and **P < .001 . 



cancers. Interestingly, we identified a distinct pattern of expression for 
CXCR4, which was elevated in triple negative breast cancer relative to 
the other groups [41]. 

More recently, breast cancers have been classified into 
intrinsic molecular subtypes (Normal-like, Luminal A, Luminal 
B, Her2-enriched, and Basal-like) defined by a 50-gene panel 
referred to as PAM50. Intrinsic subtypes add prognostic and 
predictive information to standard metrics used to categorize breast 
cancer. When analyzed across intrinsic subtypes, CXCL12 and its 
a, P, and y isoforms varied significantly (Figure 3B). Expression 
was highest in the Normal-like cluster, which is consistent with 
our data in Figure 1^4 showing up-regulation of these isoforms in 
normal samples. Luminal A had the next highest expression with 
Luminal B, Her2-enriched, and Basal clusters exhibiting lower 
expression. We also identified significant variations of 
receptors with intrinsic subtypes of breast cancer. CXCR4 showed 
differential expression among clusters with lowest levels in Luminal 
A and Luminal B subtypes and highest expression in Basal cancers. 
By comparison, levels of CXCR7 were highest in Luminal A and 
Luminal B subtypes. 



CXCL12 and its a, fi, and y isoforms vary significantly with 
race. We identified higher expression in whites than Asians or 
African-Americans (Figure W1^4). Gene-level CXCL12 and the 
a isoform also changed significantly by age group with levels peaking 
in the 50 to 60 year age group relative to younger or older patients 
(Figure W15). CXCL12-/3 and -y showed a similar pattern across age 
groups, although differences were not significant. We did not identify 
significant correlations for race or age groups for CXCR4 or 
CXCR7. CXCL12 and its a, /?, and y isoforms vary significantly 
with race. We identified higher expression in whites than Asians or 
African-Americans (Figure WL4). Gene-level CXCL12 and the a 
isoform also changed significantly by age group with levels peaking in 
the 50 to 60 year age group relative to younger or older patients 
(Figure W15). CXCL12-fi and -y showed a similar pattern across age 
groups, although differences were not significant. We did not identify 
significant correlations for race or age groups for CXCR4 or CXCR7. 

CXCL12 isoforms correlate with patient outcomes 

We next examined the correlation of gene and isoform expression 
with important clinical outcomes in breast cancer: metastasis, 
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Figure 2. Expression levels of CXCL1 2-a, -P and -y vary significantly 
by (A) tumor and (B) histology. Expression levels are means ± SEM. 
*P< .05 and **P < .001. 



Figure 3. Expression levels of CXCL12 isoforms, CXCR4, and 
CXCR7 vary with (A) hormone receptor status and (B) molecular 
subtype. Expression levels are means ± SEM. *P < .05 and 
**P< .001. 



recurrence, and OS. Kaplan-Meier curves for high versus low 
expression of gene-level CXCL12 demonstrated that low-expression 
corresponded with a significantly worse MFS (P < .008, HR = 2.2) 
but not RFS or OS (Figure 4, A-Q. Similarly, low expression of 
CXCL12-a corresponded with significantly worse MFS (P < .033, 
HR = 1.9) but not RFS or OS (Figure 4, D-F). Unlike CXCL12-a, 
low levels of both CXCL12-/3 and -y correlated with significantly 
worse MFS (P isoform P< .0015, HR = 2.6; 7 isoform P < .011, 
HR = 2.2) and RFS (P isoform P < .028, HR = 2.1; y isoform 
P < .024, HR = 2.1) but not OS (Figure 4, G-L). 

CXCL12-8, the isoform that does not correlate with expression 
patterns of other isoforms in breast cancer or normal breast tissue, had 
a different association with outcomes. Low expression of the 8 
isoform also showed trends for reduced MFS and RFS (Figure 4, M 
and N), although not statistically significant (MFS, P < .16, HR = 
1.5; RFS, P< .077, HR = 1.7). Notably, low CXCL12-S was the 
only CXCL12 isoform correlated with worse OS (P < .0035, HR = 
1.8; Figure 40). 

CXCL12, CXCR4, and CXCR7 do not operate independently but 
as important components in a complex network. We examined the 
expression levels of CXCL12-8, the least understood isoform in the 
context of the expression of the other genes in the pathway. Low 
CXCL12-5 is independently prognostic for OS even after taking 
into account CXCL12, CXCR4, and CXCR7 expression (P < .004, 
HR = 0.56) and shows the same trend in MFS and RFS (Figure 5, 
A—Q) multi-gene analyses. 

CXCL12 Isoforms Correlate with Metastatic Potential in 
Breast Cancer Cell Lines 

By nature, clinical samples such as the TCGA contain a mix of cell 
types, including tumor cells, normal breast tissue, and vasculature, 
making it difficult to identify the cell type(s) producing each transcript. 



To overcome this limitation, we examined RNAseq data in seven breast 
cancer cell lines for CXCL12 isoforms. Surprisingly, we found that 
isoform expression shows a different trend than those in the TCGA 
samples, with y showing the highest expression proportion (42%), 
followed by a (33%) > P (24%). We detected only very low levels of 
expression for CXCL12-8 (0.5%), -s (0.1%) and -<p (0.2%). 

We compared CXCL12 isoform expression levels between cell lines 
with metastatic potential and those without metastatic potential 
(Figure 6) and found that CXCL12 and its a and /? isoforms were 
expressed significantly lower in samples with metastatic potential, 
which is in agreement with the trends of isoform expression in clinical 
samples. The same trend was seen with CXCL12-y, though not 
statistically significant. 

Discussion 

While alternative splicing formerly appeared to be limited to a small 
number of genes, studies now demonstrate that almost all human 
genes undergo alternative splicing to create protein diversity [42]. 
Variants generated by loss of splicing fidelity or regulated transitions 
between isoforms drive cancer and many other human diseases, 
generating a large number of novel transcripts and proteins expressed 
predominantly or uniquely in cancer. For example, up to 36 different 
isoforms of the Wilms tumor gene 1 have been identified with specific 
variants specifically upregulated in acute and chronic myeloid 
leukemias, suggesting key functions in cancer initiation and/or 
progression [43,44]. Similarly, isoforms of vascular endothelial 
growth factor exhibit distinct functional activities in tumor 
angiogenesis that vary on the basis of anatomic site, emphasizing 
the importance of tumor environments on isoforms [45—47]. In 
addition to conferring unique functions to cancer cells and tumor 
environments, alternative splicing offers a rich source of potential 
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Figure 4. MFS, RFS, and OS curves for CXCL1 2 isoforms. Higher levels of gene-level CXCL12 (A-C), CXCL12-a (D-F), -(3(G-I), and -y (J-L) 
generally correlate with improved MFS and RFS. Higher expression of CXCL12-5 (M-O) correlates with better OS. 
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Figure 5. Multi-gene analysis reveals that expression of CXCL12-5 is independently prognostic for (A) MFS, (B) RFS, and (C) OS when 
accounting for gene-level CXCL12, CXCR4, and CXCR7. Error bars represent 95% confidence intervals. 



prognostic and predictive biomarkers. Biomarkers and targeted 
therapies based on alternative splicing may have a higher likelihood 
for success than conventional approaches centered on a whole gene or 
protein. Collectively, these studies highlight the clinical relevance of 
identifying disease-associated changes in alternative splicing. 

Prior research has established central functions of CXCL12 in 
cancer growth and metastasis, but very few studies have investigated 
isoforms of CXCL12 in cancer. In renal cell carcinoma, an analysis 
limited to CXCL12-a and -P revealed that only the P isoform 
correlated with tumor grade and infiltration of CD 8 T cells [48]. 
CXCL12-P also was upregulated in bladder cancer, a disease in 
which expression of this isoform predicted metastasis and disease- 
specific mortality [49]. This study of bladder cancer also showed 
that amounts of CXCL12-a did not change between normal and 
malignant tissues, while CXCL12--y was undetectable. Neither 
these studies nor any others have investigated the other three 
CXCL12 isoforms (8, £, or <p) in cancer due to the lack of antibodies 
against these isoforms and limitations in high throughput 
technology. 

Next-generation sequencing allows our study to fill notable gaps in 
knowledge about the CXCL12/CXCR4/CXCR7 pathway by 
providing the first characterization of expression levels of all known 
alternative splicing variants of CXCL12 in breast cancer or any other 
malignancy. We found that primary human breast cancers express 




Figure 6. Expression of gene-level CXCL1 2 and those of CXCL1 2-a 
and -(3 isoforms are significantly higher in cell lines without 
metastatic potential compared to cell lines with metastatic 
potential. Expression levels are means ± SEM. *P < .05. 



four different isoforms of CXCL12 in rank order of a > P > 7 > 5, 
while £ and (p essentially were undetectable in the TCGA breast 
cancer samples. Expression of CXCL12 isoforms varied significantly 
across many different clinical and molecular categories of breast 
cancer, including stage, histologic type, intrinsic molecular subtype, 
and hormone receptor status. Changes in abundance of transcripts 
typically occurred in parallel for each CXCL12 isoform as would be 
expected for an mRNA regulated by the same common promoter 
elements. We also discovered lower levels of CXCL12 transcripts in 
subtypes of breast cancer regarded as more aggressive, such as triple 
negative and Her2 amplified, and with progression to higher stage. 
These findings corresponded with Kaplan-Meier analyses, where low 
expression of CXCL12 and specific isoforms was associated with 
worse outcomes. We identified isoform-specific effects on MFS, RFS, 
and OS, with low levels of CXCL12-a, -p and -y significantly 
correlated with worse MFS and RFS. Most notably, we note that low 
levels of CXCL12-S associated with worse OS and showed the same 
trend for RFS and MFS, despite the fact that CXCL12-8 expression 
does not correlate with expression of the other isoforms. This 
relationship is robust and persists even after taking into account 
CXCL12, CXCR4, and CXCR7 expression in multi-gene analysis, 
indicating the independent prognostic significance of CXCL12-8. 
These data provide the first evidence that CXCL12-8 is expressed in 
human cancer and correlate with a patient outcome. 

Expression levels of CXCL12 in breast cancer cell lines generally 
mirror conclusions from the clinical samples that lower levels of 
CXCL12 correlate with worse prognosis. We found that breast 
cancer cell lines without metastatic potential (in mouse models) had 
higher levels of CXCL12 expression than cell lines that metastasize 
more widely. Studies of CXCL12 in breast cancer focus on secretion 
of this chemokine by stromal cells in primary and metastatic sites, 
frequently overlooking effects of CXCL12 produced by cancer cells. 
However, epigenetic silencing of the CXCL12 promoter has been 
reported in breast cancer cells with greater metastatic potential, and 
re-expressing CXCL12 limits metastatic disease in mouse xenograft 
models [25]. Our analysis of cell lines may inform likely sources of 
various CXCL12 isoforms in tumor microenvironments. Breast 
cancer cells express CXCL12-a, -/?, and -y with very minimal 
expression of 8, which could indicate that stromal cells are the 
predominant source of the 8 isoform in primary breast cancers. We 
also note that CXCL12-y is higher than a and /? in our panel of 
breast cancer cell lines, which is opposite the pattern in primary 
tumors. Differences between data from cell lines versus tumors may 
reflect dynamic regulation of CXCL12 isoforms in vivo, greater 
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contributions of stromal cells to overall expression of CXCL12-a 
and -P in breast tumors, or simply genomic changes as the original 
cancer samples were transformed into immortalized cell lines. In 
addition, CXCL12 levels within the tumor microenvironment may 
be affected by posttranslational modification, such as cleavage by 
CD26 or matrix-metalloproteinase-2 [50,51]. 

Isoform-specific differences in expression and breast cancer 
outcomes suggest distinct functions of individual splice variants of 
CXCL12 on disease progression. Recent studies have begun to 
identify unique biochemical properties of CXCL12 isoforms, 
particularly a, P, and 7. While all isoforms share the same core 
structure, CXCL12-P, -7, -8, -£, and -(p differ by inclusion of exons 
that add 4, 40, 51, 1, or 11 additional amino acids, respectively, to 
the carboxy terminus of the molecule [24]. Particularly for 
CXCL12-7, the added carboxy-terminal amino acids are enriched 
with basic residues that enhance binding to heparan sulfates and 
other negatively charged extracellular matrix molecules [52]. By 
comparison, CXCL12-P and, to a greater extent, -7 have reduced 
binding affinities for receptors CXCR4 and CXCR7. Biochemical 
differences in binding to receptors and extracellular matrix 
molecules translate to different functional outcomes. In mouse 
models, CXCL12-7 promotes chemotaxis of immune cells and 
endothelial progenitors to a significantly greater extent than other 
isoforms [53,54], Greater binding to heparan sulfates and 
extracellular matrix molecules also limits proteolytic degradation 
of CXCL12 [55]. These studies highlight functional differences 
among CXCL12 isoforms in receptor binding, chemotaxis, and 
stability that could alter outcomes in breast cancer. Our data also 
support further studies analyzing functional differences among 
CXCL12 isoforms, especially for CXCL12-8. 

Correlation between gene transcript data and protein expression is 
dependent on the gene and tissue type. However, mRNA expression 
is generally a good proxy for protein expression and is frequently used 
as biomarkers. [56-58] Gene expression also forms the basis of the 
PAM50 molecular subtyping of breast cancer as well as Oncotype Dx, 
a widely used predictive model for chemotherapy response in breast 
cancer. [59-62] Specifically for CXCL12-a, -P, and -7, mRNA levels 
as measured by quantitative reverse transcription-polymerase chain 
reaction correlate with protein levels as measured by ELISA. [63] We 
also recognize that this study has limitations based on the data 
publicly available through the TCGA. While the data set contains 
transcript data for a large number of patients, the median follow-up 
time is relatively short, and therefore, the number of metastasis and 
recurrence events is small, thus limiting our statistical power. This 
likely accounts for why the //"values for CXCL12-8 MFS and RFS do 
not reach significance. We also do not know the full treatment history 
for all patients, such as exact chemotherapy and radiation regimens, 
and there is likely significant heterogeneity in treatments given the 
multi-institutional nature of the data. Even with these limitations, we 
were able to identify significant differences in outcomes for isoforms 
of CXCL12. 

In summary, our data reveal new associations of CXCL12, CXCR4, 
and CXCR7 gene expression with molecular, histologic, and clinical 
categories of human breast cancer. In addition, we have identified 
isoform-specific differences in CXCL12 for outcomes in breast 
cancer, suggesting distinct biochemical functions of isoforms in 
disease progression. These compelling results establish the foundation 
for mechanistic preclinical studies of these isoforms in breast cancer. 
Additional studies are also warranted to elucidate the biologic and 



functional differences between the CXCL12 isoforms and validate 
them as potential biomarkers. 

Supplementary data to this article can be found online at http://dx. 
doi.org/10.1016/j.tranon.20l4.04.001. 
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